A Fast Algorithm for Aperiodic Linear Stencil Computation using Fast Fourier Transforms
نویسندگان
چکیده
Stencil computations are widely used to simulate the change of state physical systems across a multidimensional grid over multiple timesteps. The state-of-the-art techniques in this area fall into three groups: cache-aware tiled looping algorithms, cache-oblivious divide-and-conquer trapezoidal and Krylov subspace methods. In paper, we present two efficient parallel algorithms for performing linear stencil computations. Current direct solvers domain computationally inefficient, methods require manual labor mathematical training. We solve these problems stencils by using DFT preconditioning on method achieve solver which is both fast general. Indeed, while all currently available solving general perform Θ ( NT ) work, where N size spatial T number timesteps, our o work. To best knowledge, give first that use Fourier transforms compute final data evolving initial many timesteps at once. Our handle periodic aperiodic boundary conditions, polynomially better performance bounds (i.e., computational complexity runtime) than other existing solutions. Initial experimental results show implementations evolve grids roughly 10 7 cells around 5 run orders magnitude faster problems, 1.3 × 8.5 problems. Code Repository: https://github.com/TEAlab/FFTStencils
منابع مشابه
Fast Computation of Partial Fourier Transforms
We introduce two efficient algorithms for computing the partial Fourier transforms in one and two dimensions. Our study is motivated by the wave extrapolation procedure in reflection seismology. In both algorithms, the main idea is to decompose the summation domain of into simpler components in a multiscale way. Existing fast algorithms are then applied to each component to obtain optimal compl...
متن کاملFast Fourier Transforms
29 O(b log(b)) operations (using standard multiplication). As there are O(b= log(b)) primes in total, the running time of this stage of the algorithm is O(b 2 L), even using the \grammar school" method of integer multiplication. At this stage of the algorithm we have obtained a vector of length L whose entries are integral linear combinations of powers of with coeecients bounded by M in absolut...
متن کاملDimensionless Fast Fourier Transforms
This paper shows that there are fast Fourier transform (FFT) algorithms that work, for a fixed number of points, independent of the dimension. Changing the dimension is achieved by relabeling the input and the output and changing the “twiddle factors.” An important consequence of this result, is that a program designed to compute the 1-dimensional Fourier transform can be easily modified to com...
متن کاملFourier Transforms and the Fast Fourier Transform ( FFT ) Algorithm
and the inverse Fourier transform is f (x) = 1 2π ∫ ∞ −∞ F(ω)e dω Recall that i = √−1 and eiθ = cos θ+ i sin θ. Think of it as a transformation into a different set of basis functions. The Fourier transform uses complex exponentials (sinusoids) of various frequencies as its basis functions. (Other transforms, such as Z, Laplace, Cosine, Wavelet, and Hartley, use different basis functions). A Fo...
متن کاملFast Computation of Voigt Functions via Fourier Transforms
This work presents a method of computing Voigt functions and their derivatives, to high accuracy, on a uniform grid. It is based on an adaptation of Fourier-transform based convolution. The relative error of the result decreases as the fourth power of the computational effort. Because of its use of highly vectorizable operations for its core, it can be implemented very efficiently in scripting ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Parallel Computing
سال: 2023
ISSN: ['2329-4949', '2329-4957']
DOI: https://doi.org/10.1145/3606338